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Under equilibrium conditions, the sample average of the delays en- 
countered by all the calls submitted during a given time interval is an 
unbiased estimate of the mean of the delay distribution. If some of the 
delays are not observed, the resulting sample average need no longer be 
an unbiased estimator of the corresponding population mean. This is the 
case when, for instance, only a limited number of delays can be timed 
simultaneously. The purpose of this paper is to investigate these biases 
for queuing systems when only one clock is available and thus one delay 
only can be measured at a time. It is shown that, regardless of the order 
of service, the expected value of the observed average delays is always 
smaller than the mean waiting time for all calls. 

Although the average delay on all calls is independent of the order of 
service, the measurement biases resulting when only one delay can be 
measured at once depend on the queue discipline. In particular, we shall 
show that the average delay for all calls is always larger than the average 
delay of the observed calls even if these calls are always served last (ob- 
served-call served-last) . 

I. INTRODUCTION 

The following remarks due to J. F. C. Kingman appear in the 
Proceedings of the Symposium on Congestion Theory held at the Uni- 
versity of North Carolina in 1964 (Ref. 1, pp. 314-315) : "To illustrate 
the pitfalls of inference from congestion systems, let me tell a (more 
or less true) story. It was desired to estimate the mean waiting time 
in a particular queuing system, and for technical reasons, only one 
customer could be timed at once. Thus the waiting time a>i of a customer 
was measured. When he entered service, the next customer to arrive 
was observed and his waiting time 0J2 was noted. This procedure 
continued, the waiting times o> 3 , on, • • • being measured, and, for 
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large n, 

tt -I (wi + W2 + • • • + ^„) 

was used as an estimate of the waiting time. It is, however, strongly 
biased and inconsistent, because of the selection of the customers to be 
observed. The mean waiting time is overestimated by a factor which 
becomes arbitrarily large as the traffic intensity approaches one." We 
stress that, according to this sampling procedure, a customer is observed 
if and only if it arrives when the clock is free. 

Another instance of biases induced by the measurement procedure 
is reported by Oberer and Riesz. 2 These authors have investigated the 
possibility of estimating blocking probabilities in telephone networks 
by means of test calls generated by a single source repeatedly calling 
a dedicated number. Their study shows that the proportion of blocked 
test-calls does not yield a suitable estimate of the grade of service as 
it is markedly biased downwards. As expected the bias becomes larger 
as the intervals between consecutive (nonoverlapping) test-calls 
becomes smaller. It is also established in Ref. 2 that the relative 
test-call biases increase as the blocking probability decreases. 

It is worth noting that the biases studied in Ref. 2, as well as here, 
are of a different sign than those referred to by Kingman. Much more 
important, however, is the fact that measurement techniques which, 
superficially, appear to be adequate may prove to be very unreliable. 
It is thus becoming increasingly clear that great care is required in the 
design of performance measurements for stochastic service systems 
so that unanticipated biases are not encountered. 

The purpose of this paper is to investigate the effects of partial 
sampling on the estimate of the mean (overall) waiting time obtained 
by averaging measured delays. The biases induced by such limitations 
will be studied here for M/G/l and GI/M/s when at most one call 
can be observed at once and the estimation procedure is as described 
by Kingman. We shall see that, in these systems, the equilibrium 
average delay of the observed calls is always smaller than the equi- 
librium average delay for all calls. 

It is well known that the average delay for all calls is the same for 
all queue disciplines which are independent of the lengths of the 
individual calls (no other type of queue disciplines will be considered 
here). As we shall see, this is not true of the mean measured delay when 
only one delay can be recorded at a time. In this case, both the un- 
conditional and the conditional average delays* are (as expected) 

t As customary, unconditional and conditional delays pertain to arbitrary and 
delayed calls respectively. 
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smallest when the observed calls are served first and largest when they 
are served last. (In particular, the second extreme case occurs in 
systems with first-come last-served queue discipline.) Furthermore, 
in view of the general inequality mentioned at the end of the preceding 
paragraph, the upper bound for the unconditional average delay of 
the observed calls (which is reached when the observed calls are served 
last) is always a strict lower bound for the unconditional average delay 
for all calls ! 

The preceding result pertains to unconditional delays and does not 
always hold for the average delay of those sampled calls which en- 
counter a delay. Thus for M/M/s, the conditional average delay for 
all delayed calls is equal to the conditional average delay of the ob- 
served delayed calls so long as these are always served last. (Note that 
for M/M/s, the average delay of the delayed calls is equal to the 
average length of the busy period, and that the waiting-time distri- 
bution of the observed calls coincides with the busy-period distribution 
for the observed-served-last measurement procedure.) In contrast, 
for the M/r*/l queue, the upper bound for the conditional average 
delay of the observed delay calls is larger than the average conditional 
delay for all delayed calls when k > 1, the inequality being reversed 
whenever < k < 1. (I\ is used here to designate the gamma distri- 
bution with mean 1 and variance kr 1 . Thus M/Ti/s is identical to 
M/M/s. When k is an integer, T k is the Erlangian distribution often 
designated E*.) 

Expressions for the moments of the equilibrium delay-distribution of 
the observed calls are given for M/G/l and first-come first-served 
queue discipline. The equilibrium delay-distribution of the observed 
calls is also derived for M/M/s with order-of-arrival service. (Cor- 
responding results for the "observed-call served-first" and the "ob- 
served-call served-last" measurement procedures are immediate.) These 
formulas are used to show that the biases induced by partial sampling 
can be quite substantial. 

When the average service-time is unity, an assumption made 
throughout, the average delay, EW, for the single-server queue 
M/G/l is given by the formula (Ref. 3, pp. 46-50) : 

EW = am 2 /2(l - a), 

where a is the server occupancy and m 2 is the second moment about 
of the service-time distribution. Since m 2 can be arbitrarily large, no 
bound can be placed on the value of EW. But when only one delay 
can be timed at once, we shall see that the expectation of the observed 
delays cannot exceed 1/(1 — a). Therefore for any prescribed value 
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of a, it is always possible to find service-time distributions for which 
the ratio of the average delay for all calls to the average delay of the 
observed calls exceeds any given bound. 

To simplify the exposition we restrict ourselves to full-access delay 
systems with recurrent inputs in which delays are measured by means 
of a single clock. Some of the results obtained below can, however, 
be extended to more general situations. 

(In the sequel, W, with or without affix, is used to designate the 
waiting time of an arbitrary call while W*, with or without affix, is 
used as the generic symbol for the observed delays when only one 
clock is available.) 

II. A GENERAL DELAY FORMULA 

Consider the queuing system GI/G/s and suppose that the arrival 
and service-time distributions are such that equilibrium can be 
reached. (To avoid trivial qualifications we assume throughout that 
the mean interarrival time is finite. For the same reason, the under- 
lying distributions are also supposed to be such that simultaneous 
occurrences of events need not be considered.) The purpose of this 
section is to derive a formula relating the average delay of the observed 
calls to the equilibrium probability, «S>, that an observed call has 
immediate access to a server. To this end we prove first that 

$ = (1 - B)(l + A)/[(l - 5)(1 + A) + 5], (1) 

where B is the equilibrium blocking probability for all calls and A is 
the expectation of the number of unobserved calls originating during 
the waiting time of an arbitrary observed delayed call. 

It follows from (1) that the probability that an observed call is 
blocked is always (strictly) smaller than the overall probability of 
delay (so long as B ?* or B ^ 1, two trivial cases that we exclude 
from our considerations). This, of course, is a consequence of the fact 
that all nonblocked calls are observed whereas, with one clock only, 
some delays may not be recorded. 

We turn now to the proof of (1). Consider an infinite sequence of 
consecutive calls and for the ith call (i = 1,2, • • •) let 



X,= 



r if the ith call is delayed, 
[1 if the ith call is not delayed, 

f if the ith call is not observed, 
Yi ; = J if the ith call is observed and not delayed, 
1 if the ith call is observed and delayed. 
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Let e > 0. Then assuming that the system is in equilibrium when 
the first call arrives, we have, by the integral stationarity theorem 
(Ref. 4, p. 419), 

i. X\ +• • •+ X n EX\ 



n— » " 


(Xi 


;+ Y t + 


*)+•■■ 


+ (Xn + Y n + e) 


and 
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• • • + (X. + Y n ) 



B(Xi + Fi) + e 
*<Xi + Yx) 



(2) 



(3) 



(Xi + «) + ■ ■ ' + (I. + e) £X X + e ' 

with probability 1. However, since (Ref. 4, p. 421) 

lim Il+, " +I " = EXi = Pr [X! = 1] > 0, 

n— » oo n 

with probability 1, there is, for almost all realizations of the process, 
an integer n such that the ratios 



(Xi + Yi) + • • • + (x m + r m ) ' 

are well defined. Hence, by (2) and (3), we have 
EX 1 < lim X1+--.+ X, 



fl(X, + F x ) 4- e - „" « (Xi + Fi) + • • • + (X„ + F B ) 

^ BX, + e 



= ^(Xi + Ft) ' 
with probability 1 and, letting e tend to 0, 

.. X!+---+X n gXi m 

™ (Xi + Fi) + • ■ ■ + (X„ + F„) E(Xt + Fi) ' ^ ; 

with probability 1. 

[The preceding derivation makes use of the fact that the station- 
arity — and hence the integral stationarity — of the processes (X,-, 
i = 1, • • ■ j and [X< + F,-, i = 1, • • • } follows from the property that 
the random variables Xi and I 7 ,-, i ' = 1, • • • , whose means are finite, are 
"translates" defined on the stationarity queuing process (Ref. 4, 
p. 417 ff.). 

[The formulas in Ref. 4, p. 419, Theorem A, involve conditional 
expectations with respect to fields of invariant events. Under the 
present circumstances these expressions can be simplified. Indeed 
let us specify the state of the system, T t at time /, by means of the 
vector whose components are the arrival time of the last request 
placed before t and the elapsed portions of the service-times in progress 
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at time t. Then the only invariant sets (Refs. 4 and 5) of the process 
{ T t — °° < t < oo } are the whole space and the null-set. This 
property, in turn, implies that the conditional expectations of the 
random variables X,- and X< + F< relative to their invariant fields 
can be replaced by the unconditional expectations EX> and E(X { 4- 7,), 
respectively. These and other similar substitutions are made here 
without formal justification.] 

Consider now an infinite sequence of observed calls and let 

JO if the t'th observed call is delayed, 
i ~~ 1 1 if the t'th observed call is not delayed. 

Then (Ref. 4, p. 421) 

]im Zy + Z 2 +---+Z n = EZi = $ (5) 

n-»°o W 

Since (4) and (5) are both equal to the proportion of observed calls 
with zero delay over an interval of infinite length, we have 

* = EXx/EiXr + 7i). (6) 

We note that EXi is equal to the probability that a call (observed 
or not) is not delayed. Hence 

EXi = 1 - B, (7) 

and to complete the proof of (1) we have to show that 

EYt = B/{l+A). (8) 

To this end consider again a stationary sequence of observed calls and 
let Ai be the number of unobserved calls placed during the waiting 
time of the ith call that is both observed and delayed. Then we have 
(Ref. 4, pp. 419-421) 

,. n 1 _ 1 ( q\ 

™ n + ^+.-.+ A. TTEA! 1+A> w 

with probability 1. 

Furthermore, by the integral stationarity theorem (Ref. 4, p. 419) 
and a simple «-argument of the type used in the proof of (2), we have : 

with probabihty 1. 
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Since the left-hand sides of (9) and (10) are both equal to the propor- 
tion of delayed calls that are observed, we have 

EY 1 = E(l - X x )/(1 + A) = B/(l + A). 

This completes the proof of (1). 

Our next step will now be to relate A to the average delay, EW+, 
of the observed calls. Let W iif be the delay of the z'th observed call 
and let U, be the interval between the end of the ith and the beginning 
of the (i + l)st measurement. Let also /„ be the interval between 
the arrival epochs of the nth and (n + l)st call (in the whole sequence 
of calls, observed or not) . Then we have : 

(TFi* + Ui) + •■• + (ir n+ + U n ) = h + - -+lK ni (11) 

where K„, a random variable, is equal to the number of calls placed 
during the interval that starts with an observed call and ends just 
before the beginning of the {n + l)st measurement. By the stationarity 
theorem, we have (Ref. 4, p. 421) : 



Urn (^u+^+-+(^+^ _ E{W ^ + Ul)f 



(12) 



with probability 1, and, by the strong law of large numbers (note 
that the /„'s are, by assumption, independent random variables with 
finite means and that K n ^ n), 

lim *i + / » +•••+**■ = a -., (13) 

n— » oo li-n 

with probability 1, where a -1 is the expected interarrival-time. 
Furthermore, 

K n = [ZH-(1 -Z x ){\ +i4i)]+-"+ [Z n + (1 - Z n )(l +A n )2, 

so that 

lim — " = E[Z X + (1 - Zx)(l + A x )~\ = * + (1 - *)U + A), (14) 

n _oo n 

with probability 1. 

Combining (1 1)— (14) we find that 

«E{W H + l\) = 1 + A(l - *). (15) 

In particular, when the input is Poissonian, EJJ\ = or 1 and (15) 
reduces to 

aEW^ = A(l - $). (16) 
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Thus, taking (1) into account, we find that : 

EW* = EW U = (* + B - 1)M1 - B). (17) 

It should be noted that the preceding relation is valid regardless of 
the order of service. 

When the calls are served in order of arrival, (16) is an immediate 
consequence of the fact that the waiting time of any given call is not 
affected by the stream of requests placed after its arrival epoch. This 
is also true when the observed calls are always served first and (16) 
can then be written down with equal ease. 

We note that (17) can be obtained quickly whenever the epochs 
at which measurements begin or terminate constitute a renewal 
process. In such cases, the expected number of observations in time t 
is (asymptotically) 

(EW* + a" 1 ) -1 -* + 0(1). * lar S e > ( 18 ) 

and the expected number of arrival points at which the system is 
empty in time t is 

a(l - B)t + 0(1),* large. (19) 

The long-term proportion of observed calls with no delay is given by 
the ratio of (19) to (18) with probability 1 (cf. Ref. 6, p. 264, alter- 
native form of Theorem IV) : 

«(1 -B)(EW* + or 1 ). (20) 

Since the probability $ that an arbitrary call is not delayed is in- 
dependent of the past, a simple application of the strong law of large 
numbers show that (20) may be equated to $ and (17) therefore holds. 
An instance where the preceding argument can be applied is the 
M/G/l system with observed calls always served last. With this 
order of service, there is exactly one call in the system at the termina- 
tion of each measurement. These epochs constitute a renewal process 
since they also coincide with the beginnings of the service-times of 
the observed calls. With an obvious change, the previous argument 
remains true for M/M/s with observed-calls served-last. 

III. TWO EXTREME CASES 

In this section we show that if only one clock is available then the 
expected average delay of the observed calls is largest when the 
observed calls are served last and smallest when the observed calls 
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are served first. These relations are not statistical : They are satisfied 
by all the realizations of the process over any finite or infinite time 
interval regardless of the arrival and service-time distributions. (To 
avoid ambiguities, we assume that the timing device is free at the 
beginning of the realizations.) 

We note first that under any measurement procedure, all the calls 
which arrive when all the servers are busy, but no request is waiting, 
are observed. These calls are the only delayed calls that are observed 
when the observed calls are served last. Therefore (i) the number of 
observed delayed calls takes its smallest value for the observed-served- 
last procedure and (it) during the measurement of a delay, D, under 
this particular procedure any observed delay under any alternate 
single-clock measurement procedure cannot exceed D. Combining these 
two facts and taking into account that all calls with zero delay are 
observed we may conclude that the observed average delay takes 
always its largest value when the observed calls are served last, as is 
the case for the first-come last-served queue discipline. 

When the observed calls are served first, we note that (i) the number 
of observed delays over any busy period (initial or not) is never smaller 
than for any other single-clock measurement procedure and (it) to 
each observed delayed call there corresponds, under any other single- 
clock measurement procedure, one call whose delay is at least as 
large and this correspondence involves all the observed delays under 
the alternate procedure. All calls with zero delay are again observed 
and the average delay of the observed calls takes therefore its smallest 
value when the observed calls are served first. 

Clearly the conditional average delays of the observed calls do have 
the same property. 

iv. Bounds for the average delay of the observed calls in 

M/G/l 

The object of this section is to determine the upper and lower 
bounds for the average delay, EW+, of the observed calls in M/G/l. 
These bounds, as we have seen, are reached when the observed calls 
are served last and first respectively. Under the present conditions, 
formula (17) may be written as follows: 

EW* = (# + a - l)/a(l - a). (21) 

For a given server occupancy a, EW+ is a monotone increasing function 
of * = *(«). Since * < 1, (21) implies that EW* < (1 - a). Hence 
for a < lfEW+ia always bounded (but EW is not). 
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It will be convenient to define here the service backlog, at a given 
instant t, as the sum of the service-times of all waiting requests plus 
the residual of the service-time of the request being served. [When 
calls are served in order of arrival, the service backlog is equal to the 
virtual waiting time (Ref. 3, p. 59 ff.).] 

Now let F(-) be the stationary cumulative distribution of the 
service backlog at the end of a measurement. The probability, $(a), 
that an observed call does not suffer a delay is simply the Laplace- 
Stieltjes transform of F( ■ ) evaluated at a since it is equal to the proba- 
bility that no call originates during a time interval whose length is 
that of the service backlog: 



$ 



(a) = r e- at dF(t). 
Jo 



Writing o-(-) for the Laplace-Stieltjes transform of the service-time 
distribution we have the following inequality : 

$(a) ^ a (a). (22) 

This inequality is a consequence of the fact that, at the conclusion of a 
measurement, the service backlog may be represented as the sum of 
two independent random variables, one of which is the full service-time 
of the request whose delay has just come to an end while the other is 
equal to the sum of the service-times of all the waiting requests. 
Writing R(-) for the c.d.f. of the latter and S(-) for the service-time 
distribution, we have : 



<*>(„) = f° e -«t d f l R ( t _ v )dS(v) 
Jo Jo 

= a (a) r e-* l dR(t) £*(<*). 
Jo 



When the observed calls are served last, 

_ fl for t ^ 0, 
aw |0 for t < 0, 

and 

$(a) = <r(a). 

We can therefore conclude that 



EW^ 'M*"- 1 - (23) 

* a(l — a) 
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We are now in a position to prove that the average delay, EW+, is 
always smaller than the average delay for all calls (observed or not). 
Since the service-time is unity, we have: 



*(«) ^ «r(«) = r e- at dS(t) < 1 - a + 
Jo 



er/«2 



where m 2 is the second moment of S about the origin. Hence, substi- 
tuting 1 - a + ahn.t/2 for a (a) in (23) we find that, irrespective of the 
service order : 

EW * < W=T) - £,F - < 24 > 

By (23) and the equality in (24) we also have : 
EW/EW* > °^ , 

a 

so that, for any given a, we can always find a service-time distribution 
such that EW/EW+ exceeds any preassigned value. 

We now derive an absolute lower bound for EW+. As shown above, 
this bound can be found by assuming that the observed calls are 
served first. Our first step here is to determine A. 

For the observed-served-first procedure, the circumstances under 
which a positive delay can be observed are as follows : at some time 
the clock is not in use and a service-time begins, and during this service- 
time a new call arrives. Thus A is the conditional expectation of the 
number of arrivals minus 1 during an arbitrary service-time given that 
at least one call is placed during a service- time. A, therefore, is given 
by the formula 

A = r t (« - l)^V-'dS(fl / r (1 - <r«<)dS(t) 

J n = i n ■ /Jo 

= J* [at - 1 + e- at ~\dS{t) / T (1 - e-**)dS{t) 

= [a - 1 + <r(a)]/[l - „(«)]. 

By means of (1) and (21), it is now readily shown that, for the ob- 
served-calls-served-first procedure : 

*(«) - o - * 



2 — a — a {a) 
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and 

FW *(«) + " - ? 

Summing up, we have the following inequalities for * and EW+, 
regardless of the measurement procedure : 



1 - 



^-r\ ^ *(«) ^ *(«)> ( 25 ) 



2 - a - <r(a) 
and 

,(«)+« -1 < £|F ^ *(«) + « - 1 . (26) 

a[2 - a - <x(a)l ~ * a(l - a) 

For exponential service-times, (25) and (26) reduce to 

(1 - a 2 )/(l + a - a 2 ) ^ *i(a) £ 1/(1 + a), 
a/(l + a - a 2 ) ^ £TF*i ^ a/(l - « 2 )- 

(The subscript 1 is added to $ and #TF + to indicate that the service- 
times are exponentially distributed.) 

V. THE SINGLE-SERVER QUEUE M/G/l WITH ORDER-OF-ARRIVAL 
SERVICE 

In this section we consider the M/G/l queue under the assumption 
that the calls are served in order of arrival. Our principal aim here is 
to determine * = *(«) and then, by means of (17), EW*. To this end 
let p„ be the probability that there are n calls in the system immediately 
after the conclusion of a measurement. Then, relating the state proba- 
bilities at two consecutive conclusions of delay measurements, we 
find that 



Pi - £ Vn [" e-*'dS^(t) + f pn (" ater*'dS<» ) (t), 

n-1 JO n = l JO 

Vk= t PnT^e-'dS^it), k> 1, 
„=i Jo fel 



where S ln) is the nth convolution of the service-time distribution, S, 
with itself. 
Now let 



G(x) = £ p n x n . 



1 



Equations (27) yield: 
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G(x) = L p m x m - £ x m £ p n r^^e-^dS^it) 

m=l m=l n=l ^0 W! 

IjZ p„ /"V"US<»>(<) 

n = l ./0 

= £ Pn f [ f ^-"e-'ldS<")(0 + x t p n c»(a) 

n=l Jo L "1=1 m - J n=l 

= £ Pn (* tr* 1 ^* - l)dS^(t) +X £ p„<r»(a) 

n = l ./o n=l 

= £ p n c7"[a(l - x)] - (1-x) £ p n <r»(a) 

n = 1 n = 1 

= G{,[«(1 - x)]} - (l-x)G[<r(«)]. 

Summing up, we have the relation 

(?(*) = G{«r[>(l - x)2) - (1 - x)G[a(a)]. (28) 

Note also that 

$(<*) = £p» I" (r-'dS^it) = (?[>(*) ]. 
l Jo 

Let x — <r(a) and x„ = a[a(l — x„_i)], n = 1, 2, 
Since £ a < 1, we have x < xi < • • • ^ 1 and lim„-.ooX n does 
therefore exist. With this notation, we obtain, from (28) : 



*(a) = G(x0 - (1 - x )$(a), 
G( Xl ) = (?(x 2 ) - (1 - x^(a), 



(29) 



(?(x„_i) = G(x„) - (1 - x„_i)*(a). 
Adding up these relations, we find that : 

*(«)Ti+ x: d-x m )] = G( Xn ), 

m=0 J 

and, by passing to the limit, 

(«)[l+ m f: o (I-*-)] = 1- (30) 



'!>( 



(Note that lim„-. 00 G(x n ) exists since the x„ constitute a positive 
monotone-increasing sequence bounded by 1 . By letting n — > °° in the 
last of the relations (29) it follows immediately that lim„-»x„ = 1 so 



* 
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long as *(«) ^ 0. This last condition is however clearly satisfied 
whenever a < 1.) 

In particular, when the service-times are negative exponential, we 
have: S(t) = 1 - er 1 , t ^ 0, <r(s) = (1 + s)~\ and (1 - x m ) = a m+1 / 
(1 + a + • • • + a m+1 ). Hence, by (30), 

[„,m+l "1-1 t v 

i+r l+a ;... +g >» J • (31) 

We examine briefly the case where the service-times have a gamma 
distribution with parameter k (the subscript k is added to the symbols 
considered earlier in order to stress their dependence on k). We have, 
in this case : 

a* (a) = [fe/(fc + «)]*• 

Then <r k (a) is a strictly decreasing function of fc(> 0) as can be seen 
by taking the derivative of In a^ l {a) = In (1 + a/k) k and using the 
inequality In (1 - x) > x/(l + x), x>0 (Ref. 7, p. 68). This 
monotonicity property of Ok implies that 

x M > Xk+h,o, Xhi > Xk+h.i,- ■ ■, k > 0, h > 0, 

and we have, therefore : 



£ (1 - x km ) < £ (1 - x k+h , m ), h > 0, 

m=0 m=Q 



so that 

and from (21) 



$ k (a) > $ k +h(ot), 

EW* k > EW^t+h, k>0. 

For k = 1 (exponential service-time) the conditional average delay 
for the delayed calls under the observed-last-served procedure is equal 
to the average length of the busy period. To see this one need only 
note that each positive observed delay begins with an arrival that 
occurs when there is exactly one customer in the system and ends 
when, for the first time thereafter, there is no waiting customer. 
Hence, for k = 1, the conditional delay distribution of the observed 
calls is the same as the busy-period distribution (Ref. 8, p. 32). Since 
the average length of the busy period and the average of the condi- 
tional waiting times of all the delayed calls are both equal to (1 - a) -1 
(Ref. 3, p. 63), we have 



EWi ^ EW*! a-ija) + a - 1 = 1 

a 1 - ci(a) [1 - <ri («)>(! - a) 1 - a ' 



(32) 
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where EW^i designates the average delay when the observed calls 
are served last. 

Clearly, the inequalities (22) and (23) imply that 

EW* k a k {a) + a - 1 

l-**-[l -a k {a)-\a{\ -a)' {6 *> 

We note that if the service-times have a gamma distribution with 
transform ak(s) = (k/k -+- s) k , then the conditional average delay on 
all delayed calls is given by the formula (Ref. 3, p. 50) : 



EW, 



fc+ 1 



' (34) 



2(1 - a) 

Substituting (k/k + a) k for Ck(a) in (33) we obtain the following 
upper bound for the conditional average delay of the observed delayed 
calls (this bound, as we know, is reached when the observed calls are 
served last) : 

[&/(* + «)]* + «- 1 (35) 



{1 -[*/(* + a) ]*}«(1 -a) 

Subtracting (34) from (35) we find that the difference is of the same 
sign as 

a ~ {1 - [k/(k + a)]*}[l + (k + l)a/2j (36) 

The two factors in the second term of (36) are both increasing func- 
tions of k and since (36) vanishes for k = 1 [a fact that we already 
know from (32)] we may conclude that (35) is smaller than (34) for 
< k < 1, and greater than (34) for k > 1. This proves that, for 
k < 1, the conditional average delay for all delayed calls is still larger 
than the conditional average delay for all observed delayed calls even if 
these calls are served last. For k > 1, the conditional average delay 
of the observed delayed calls for the observed-served-last procedure 
is larger than the conditional average delay of all the delayed calls. 

Expressions for the higher moments of the observed calls delay- 
distribution are also readily obtained. Let F be the equilibrium cumu- 
lative distribution of the virtual waiting time, V, at the conclusion 
of the measurement of a delay (this delay, of course, may be equal to 
zero). The delay distribution, K, of the calls whose delays are observed, 
can be readily expressed in terms of F. Indeed we have : 

K(w) = Pr IU\ £w] = aj" f X exp - a(y - t)-dF(y)\dt 

+ J%-°»dF(y). (37) 
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Table I — Means and Standard Deviations of the Delay Dis- 
tributions for All Calls and for All Observed 
Calls (1 Clock) in M/M/l— First- 
Come First-Served 



a 


EWi 


SWi 


EW*i 


8W*i 


0.1 


0.11111 


0.48432 


0.09259 


0.42264 


0.2 


0.25000 


0.75000 


0.17840 


0.58463 


0.3 


0.42857 


1.0202 


0.26684 


0.72360 


0.4 


0.66667 


1.3333 


0.36669 


0.87308 


0.5 


1.0000 


1.7321 


0.48958 


1.0590 


0.6 


1.5000 


2.2913 


0.65554 


1.3188 


0.7 


2.3333 


3.1798 


0.90754 


1.7310 


0.8 


4.0000 


4.8990 


1.3659 


2.5191 


0.9 


9.0000 


9.9499 


2.5808 


4.7519 


0.92 


11.500 


12.460 


3.1398 


5.8266 


0.94 


15.667 


16.637 


4.0284 


7.5793 


0.96 


24.000 


24.980 


5.6974 


10.987 


0.98 


49.000 


49.990 


10.243 


20.776 


0.99 


99.000 


99.995 


18.393 


39.434 



The problem of finding the distribution K of the observed delays 
is thus reduced to the problem of finding F. The distribution F satisfies 
the following integral equation: 

._„„(«2/) n 



n=i Jo Jo ni 



+ 



f dS(u) J~e-*»dF(y), (38) 



where £ (n) stands for the nth convolution of S with itself. Equation 
(38) follows immediately upon noticing that, at the conclusion of a 
measurement, the only calls in the system are (i) the call whose delay 
has just been measured and (ii) all the calls which arrived during the 
measurement interval (we note that the preceding argument makes 
essential use of the assumption that calls are served in order of arrival). 
Let <p and a be respectively the Laplace-Stieltjes transform of F 
and S. Then transforming (38) we obtain 

__„,, («y) w 



*dF(y) 



<p(s) = f *•(«) re-°»^dF(y) + <r(s) f° 

n=l JO "i JO 

= p{a[l - cr(s)]} + *(a)|>(a) - 1]. (39) 

This formula may be used to derive recurrence relations for the 
moments of V. Let n\fi n = EV n and 

d 



n\v n = (- l) n ^*00 



= f t n dS(t), 

t-0 Jo 
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Fig. 1 — Average delay for M/M/l vs occupancy. 



1.0 



so that n\v n is the nth moment of the service-time distribution. Using 
Faa di Bruno's formula for the derivative of a composite function 
(Ref. 9, p. 36) we find that: 



Mn=E 



fc! 



nifit k v\ l - "vfc + $(a)v», 



(40) 



with k = ki + • • • + fc„ and the sum over all solutions in non-negative 
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Fig. 2 — Conditional average delay for M/M/l vs occupancy. 



integers of k\ + 2fc 2 -\ \- nk n = n (note that v t = 1 since the 

average service-time is assumed here to be equal to 1). 
In particular, for n = 1,2, and 3 we have : 

Ml = EV = *(«)/(l -a), 
EV 2 



M2 - 



2! 



= *(aW(l -a)(l -« 2 ), 



M3 = IT = $W ^ + " 3(1 " a2) J /(1 " a)(1 ~ a2)(1 " a3) ' 

where $(a) is the probability that an observed call is not delayed. 

When the service-time distribution is exponential (with mean 1) 
we have v, : = 1 , i = 0, 1 , ■ ■ • , and (40) becomes : 
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Fig. 3 — Average delay for M/D/l vs occupancy. 



Mn = Z 



fcl 



Hkd k + *i(a) 






n > 1. 



Equation (37) may be used to express the moments of K in terms of 
the moments of V. We have: 

roo reo rx 

EW% = I w n dK{w) = a w n exp - a(y - w)dF(y)dw, 

J0 J0 Jw 

and, upon integrating by parts, we obtain 

EW„ - EV - -[1 - $(«)], 
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and 



Fig. 4 — Conditional average delay for M/D/l vs occupancy. 

n+ 1 



EWl +1 = EV n+l - 



EWl, 



n > 0. 



Thus, in particular, we have 

$(a) + a — 1 



* a(l — a) 



EW% = 



2a 2 $(a)v2 - 2[$(a) + a - 1](1 - a 2 ) 
<* 2 (1 -a)(l -a 2 ) 



The moments of W t depend only on the moments of the service-time 
distribution and on $(a). 

As a numerical illustration of the biases induced when only one 
clock is available, the means and the standard deviations of W\ and 
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a 

Fig. 5 — Conditional average delay for M/T}/1 vs occupancy. 



W+i are given in Table I. (The standard deviations of W\ and W+i 
are designated by SW\ and SW+x respectively.) For further quanti- 
tative results, see Figs. 1-6. 



VI. THE SINGLE-SERVER QUEUE M/M/l 

In this section we consider a single-server delay system and assume 
that: (i) calls arrive in a Poisson process of intensity a; (ii) the 
service-times are independent random variables with the same negative 
exponential distribution ; and (Hi) calls are served in order of arrival. 
We again suppose that only one delay can be measured at a time. Our 
purpose here is to derive the delay distribution of the observed calls. 

Let F(-) be the equilibrium cumulative distribution of the virtual 
waiting time at the conclusion of the measurement of a delay; the 
measured delay may of course be equal to zero. From (38), the distri- 
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Fig. 6— Conditional delay distributions for M/M/l -a = 0.5 first-come first-served. 

bution F(-) satisfies the following integral equation: 
„_„„ (<*y)» u"" 1 



F(t) - f'\ r± 

JO [ Jo n=l 



e -ay 



n\ (n-1)! 



r ,e-"dF(y)\du 



+ 



I e~ u du- f e-""dF(y). 
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This relation implies that F(-) is continuous and that the virtual 
waiting time, at the conclusion of a measurement, has a density func- 
tion /( ■ ) [at t = 0, the latter is defined as the right-hand derivative 
of F(-)]- We have therefore 

/(*) = e~' f K f(y)e-» f ^ f^W 'fr + e "' f °° f(v)r«dy 

Jo n =i nl {n — L)l Jo 

= ah-H-* J" f(y)e- a ^h[2 (ayt)^dy + e~ l J" f(y)e-<"<dy, (41) 

where h(') is the modified Bessel function of order 1 (Ref. 7, p. 374). 
The preceding relation implies that /( • ) is of the form 

/(*) = A(0 + «r', 

where c is a constant. Substitution of this expression in (41) yields, on 
taking relation 29.3.81, p. 1026, of Ref. 7 into account: 

hit) = ah-V J Q h{y)e-"^h[_2{ayty-]dy + ( ^_ **'««>. (42) 

Thus /i ( • ) is of the form 

A(0 = Mt) + (1 + a y exp - t/{\ + a), 

and substituting this expression in (42) we find that /s(') is of the form 

M) = /•«> + (1+ C a a2 +a2) 2 exp - «/(l + « + a 2 ). 

Proceeding in this manner, we define successively ft ( • ), /e( • ), ■ ■ ■ , and 
it is readily shown, by induction, that : 

f m (t) = f m+ i(t) + 



(1 + a + a 2 + ■ ■ • + a m ) 2 
•exp - 1/(1 + a +• ••+«"*), ?7i = 0, 1, 2, •••; 

/„(•)= /(•)• (43) 
Passing to the limit, we obtain, in this manner : 

f{t) = Ut)+c± 



=o (1 +a+---+a m ) 2 

■exp - t/{\ + a -\ \- a' 



where /*(•) = lim m -» x / m (-) satisfies the integral equation: 

U(t) = a*e-'H r U{y)e-"^h[2(ayt)^dy. (44) 

Jo 
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Note that, by virtue of (43), f m (t) > f m+ i(t) for all t and that 
lim TO -oo/m(0 does therefore exist and is non-negative since f m > for 
all m. 

We shall prove now that the only non-negative solution of (44) 

is U(t) = 0. 

Let 



6(s) = J™ Utfe-'dt. 



Then, transforming the previous relation, we have by Ref. 7, p. 1026, 
equation 29.3.81 : 



dt 



6(s) = a* f e-'e-'tr* j" U(y)e- av y i Iil2(ayt)qdy 

= a* f U(y)e- av y i (<xy)~ i (<r vll+ ' - i)dy 
= f U(y) exp - ayll - ^+- s ) d y 

/■CO 

- / U(V) exp (- ay)dy 

Setting s equal to zero in the preceding relation, we find that 0(a) = 
which implies that f^it) m and we have, therefore, for exponential 
service-times : 

" a m 

f{t) = ° A (l+«+--- + a»0 2 



1 - F(t) = c Z 



•exp -t/(l +« + ••• + «"*), t^O, 
a m (45) 



fto 1 + a + • • • + ot m 

•exp -t/{\ + «+•••+<**), « ^ 0, 
/(«) = F{t) = 0, « < 0, 

where the constant c is determined by the condition F(0) = 0. 

By means of (37) and (45), it is readily seen that the conditional 
delay distribution is given by the following formula : 

Pr [W+ ^ t\ observed call delayed] = c' £ rq: — _i r «,+i 

•exp - t/{\ + «+••• + «"), < >0, 
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where c' is determined by the requirement that the preceding expres- 
sion be equal to 1 for t = 0. 

The effect of partial sampling on the delay distribution is illustrated 
in Fig. 6. 

VII. THE MULTISERVER QUEUE M/M/s 

In this section we consider a full-access multiserver delay system 
with Poisson arrivals and exponential service-times. Our purpose here 
is to determine the probability <i>{ s) that an observed call is served 
without delay and the expected delay EW$\ of the observed calls 
(<l>(i> = $ h w$i = W+i). This is easily done. Indeed, under the present 
assumptions, A[ s) the expected number of nonobserved calls during 
the measurement of a positive delay is : 

S(l - $i) 

where EW+i and $i pertain to the single-server queue with load a/s. 
Equation (46) is an immediate consequence of the fact that in an 
s-server system with demand rate a and service rate 1, the conditional 
average delay of an observed call, EW$\/(1 — <£>{ s) ), is equal to the 
conditional average delay of an observed call in a single-server queue 
with offered load a/s and service rate s, i.e., EW+\/s{\ — *i). 
Hence, by (46) and (1) we have 

__ (1 - B)[_(a/s)EW ifX + 1 - $j] 

91 ~ (1 - B)l(a/s)EW^ + 1 - *i] + B(l - *0 

so that, by (17), 

BEW*i 



EW$\ = 



s(l - #0 +a(l - B)EW+ X 
We note that 

EW% = (1 - a/s)EW^ 

EWP 1 - *i + (1 - B)(a/s)EW*i 

For a/s fixed, the blocking probability B is strictly decreasing and 
tends to as s increases. Hence 

EWft EW& m > 

EW[ S) EW{ s + m) ' ' 

and 

\EW*\ EW£\ (1 - a/8)EW*i 

[ EW \ x " .™ EW[*> 1 - $i + (a/8)EW*i 
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We stress that the preceding relations are valid regardless of the order 
of service. 

Numerical values are given in Table II. They show, in particular, 
that, for a given server occupancy, the magnitudes of the relative 
biases become larger as the number of servers, s, increases but remain 
bounded. 

VIII. AN INEQUALITY FOR GI/M/s 

For the M/G/l queue we have seen that the average delay on all 
calls, EW , is always larger than the expected delay EW* even if the 
observed calls are served last. It will be shown here that the same 
relation also holds for the multiserver queue GI/M/s. 

When the observed calls are served last, the waiting times of the 
observed delayed calls have the same distribution as the busy period 
whenever the service-times are exponential. Writing EWt\ for the 
unconditional average delay for the observed-served-last procedure 
we have therefore: 

EW<& = (1 - *)/s(l - b), (47) 

where b is the root of smallest absolute value of the equation (Ref. 10, 
p. 225 ff.) 

z = A*(l - z) 

and A* is the Laplace-Stieltjes transform of the interarrival distri- 
bution A. We note here that b is also the blocking probability in the 
associated single-server queue GI/M/1 with A as interarrival distri- 
bution and exponential service-time distribution with mean 1/s. 
By means of (1) we can rewrite (47) as follows : 

EWfA = B/s(l - 6)[(1 - B)(l + A) + 5], (48) 

where B is the probability of delay in the GI/M/s queue. 

When the observed calls are served last, 1 + A is equal to the 
expected number of calls served during a busy period of GI/M/s 
which, in turn, is equal to the expected number of calls served during 
a busy period of the associated single-server queue GI/M/1. Hence, 
we have (Ref. 10, p. 286) : 

1 + A = (1 - b)~\ 
and on taking this relation into account, (48) reduces to 
EW% = B/s£l - B + 5(1 - &)]. 
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But the average delay for all calls is given by the formula (Ref. 11, 

p. 383) : 

BWF = B/s(l - b) 
so that 

EW ft < EWi'K 
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